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<400> 1 

acgagtgctt ctcttattat gcgagctgtt tattccaaag tatgttcgca attttcgact 60 
cctgctaaca taacgcacgg ttaaagcagg aacatttggg cctataagcc caaaatttca 120 
ttagcttaat acgatgctcc gaagtgttat tgcatttgca catatacata aaattgtgac 
atagaatagg agaattccac atacaaatac aaaaatacaa aatcctccag taaaatttaa 
aacgatatcg tgttttgctt cgcgtatctc acgtgagatg taatcgcatg catatgagtg 300 
gtgagtgcct gcgtgcagtt cctggtctaa atatgcttaa ttgcgttcgc cgacttcaaa 360 
agcaataaaa cgatggattt taattgctac ttgagcaatt agccacacaa gggatcttgg 420 



gaaggtcgat ttgaaggaat tcgatttcta ggatgctctc gacaaca atg ccc cgc 

Met Pro Arg 



180 
240 



476 



agt cca acc caa caa cag ccg caa cca aac tec gat gec tec tea aca 524 
Ser Pro Thr Gin Gin Gin Pro Gin Pro Asn Ser Asp Ala Ser Ser Thr 
5 10 15 

agt gca tct gga tea aat cct gga gca gcg ate gga aat ggg gac teg 572 
Ser Ala Ser Gly Ser Asn Pro Gly Ala Ala lie Gly Asn Gly Asp Ser 
20 25 30 35 

gcg gcg age aga agt tct ccg aag acc ctt aat age gaa ccc ttt tct 620 
Ala Ala Ser Arg Ser Ser Pro Lys Thr Leu Asn Ser Glu Pro Phe Ser 
40 45 50 

act ttg teg ccg ggtaagactt gtattgattt ctctttgtcc ggaattataa 672 
Thr Leu Ser Pro 
55 

caactttctg tgtttcca gat caa ata aaa ttg acg cca gaa gaa ggc act 723 

Asp Gin lie Lys Leu Thr Pro Glu Glu Gly Thr 
60 65 

gag aaa age gga eta tea act agt gat aaa get gee act gga gga gec 771 
Glu Lys Ser Gly Leu Ser Thr Ser Asp Lys Ala Ala Thr Gly Gly Ala 
70 75 80 

cca ggc agt gga aat aat ctg ccc gag gga caa act atg eta agg cag 819 
Pro Gly Ser Gly Asn Asn Leu Pro Glu Gly Gin Thr Met Leu Arg Gin 
85 90 95 

aac tct acg age aca ate aac teg tgc eta gtc get tct cca caa aac 867 
Asn Ser Thr Ser Thr lie Asn Ser Cys Leu Val Ala Ser Pro Gin Asn 
100 105 110 



tec agt gaa cac teg aat age age aat gtg tct get aca gtg ggc ctt 915 

Ser Ser Glu His Ser Asn Ser Ser Asn Val Ser Ala Thr Val Gly Leu 

115 120 125 130 

act cag atg gta gat tgt gac gag caa teg aag aaa aac aaa tgt agt 963 

Thr Gin Met Val Asp Cys Asp Glu Gin Ser Lys Lys Asn Lys Cys Ser 
135 140 145 

gtg aag gac gag gaa get ggtaagactg ccctacaaat ggtttaaaat 1011 
Val Lys Asp Glu Glu Ala 
150 



tttaaaatgt 


attggcgttc 


acctttgtta 


atcatttaat 


tgtttttttt 


ttgetatact 


1071 


tacaatttta 


gttttaaact 


tgtaaacttg 


actaaaactc 


gegaagcteg 


gatcaaaaca 


1131 


gacattttct 


tggaaccgta 


attaagctca 


taaaaatatt 


aattcatctt 


gatggaatgc 


1191 


atatcataga 


tgtactcaaa 


catctcaaga 


aagacctcaa 


attggatcaa 


ctaattagtt 


1251 


tgagaaaaaa 


ttgctgtact 


tttaagaata 


tattaattta 


aaaatttget 


gagtgaaatg 


1311 


atataatagt 


cacaataatt 


tttagttaaa 


ctgetaaage 


attttgaata 


gccgtgctac 


1371 


geagatgeta 


etagacgegg 


tgtaaaagct 


aatttttatt 


taaaagctgt 


cctaatattc 


1431 


cataaccatt 


aatgtcccat 


ttca gaa ata agt tct aat aaa gca 
Glu lie Ser Ser Asn Lys Ala 
155 


aaa ggt 
Lys Gly 
160 


1482 



caa gca get ggt ggc ggc tgc gaa aca ggt tct aca tec agt ttg act 1530 
Gin Ala Ala Gly Gly Gly Cys Glu Thr Gly Ser Thr Ser Ser Leu Thr 
165 170 175 



gtc aag gaa gaa ccc acc gat gtc tta ggc agt tta gta aat atg aaa 
Val Lys Glu Glu Pro Thr Asp Val Leu Gly Ser Leu Val Asn Met Lys 
180 185 190 



1578 



aaa gaa gaa aga gaa aat cat teg cca acg atg tec cct gtt ggt ttt 1626 

Lys Glu Glu Arg Glu Asn His Ser Pro Thr Met Ser Pro Val Gly Phe 

195 200 205 

ggt tea att ggt aat gca cag gac aac tec get aca ccg ggtaagtttt 1675 

Gly Ser lie Gly Asn Ala Gin Asp Asn Ser Ala Thr Pro 



210 


215 


220 








aagagatcca 


tataaagcaa 


ataacaagaa 


ttaatgtcag 


ttaccaattt 


tatttgatag 


1735 


tcaaagaact 


actatagega 


tatctcctgc 


cttttaattt 


tattttaatt 


aggaaatacg 


1795 


aatatttcta 


atttgtaaaa 


taaaattgat 


taattaacta 


gaatttaaaa 


accttttgaa 


1855 


ttaggacata 


cccttccaaa 


aatcagtaat 


cattgggaac 


gagagtgtgg 


tcccgaagga 


1915 


gactactata 


aaaccttttg 


agctatctga 


tactgcacgc 


tactaaaaat 


gattagttta 


1975 



ggaaaatggg tgtaattttg taggaagttt tcattttaga agaaatgtga ttattttatt 2035 

aaaccccttc aagcggaact acatttgttc tacgatattt tggaaaaaca aatggttaag 2095 

ttggaaagtg cctataaaac agaattccac ggtttcaaat actaaccagg tttttgattt 2155 

aattttgatt aaatgagaaa ttatcacact tcagttaaaa tgtttaattc gattaaggtc 2215 

ggacaatcac agcagatttc catttttgcg tgtatatata gaagtcgcct tcacactctt 2275 

ctggcgcgct tcaccactac gtggagttcc gcccgcagtg atttatatag atgatttacg 2335 

agttatttaa ttttttatgg tgtattttaa taaatatctt atttattcat tttacata 2393 

gtt aaa att gaa aga att tea aac gac agt acc acg gaa aaa aaa gga 2441 
Val Lys He Glu Arg He Ser Asn Asp Ser Thr Thr Glu Lys Lys Gly 
225 230 235 

teg tec ttg aca atg aat aat gac gaa atg age atg gaa ggc tgc aat 2489 
Ser Ser Leu Thr Met Asn Asn Asp Glu Met Ser Met Glu Gly Cys Asn 
240 245 250 

cag ttg aat ccc gat ttt ate aat gaa tct tta aat aat cct gca att 2537 
Gin Leu Asn Pro Asp Phe lie Asn Glu Ser Leu Asn Asn Pro Ala He 
255 260 265 270 

teg age ata tta gta age gga gta gga cca ata ccc gga ate gga gtt 2585 
Ser Ser He Leu Val Ser Gly Val Gly Pro He Pro Gly He Gly Val 
275 280 285 

gga gcg ggg acg gga aat tta ttg act gcc aac gcc aat gga ate tec 2633 
Gly Ala Gly Thr Gly Asn Leu Leu Thr Ala Asn Ala Asn Gly He Ser 
290 295 300 

teg ggt age agt aat tgt ttg gat tac atg caa cag caa aat cac ata 2681 
Ser Gly Ser Ser Asn Cys Leu Asp Tyr Met Gin Gin Gin Asn His He 
305 310 315 

ttc gtg ttt tea act cag ctg gcc aac aaa ggg gcc gaa tea gtt tta 2729 
Phe Val Phe Ser Thr Gin Leu Ala Asn Lys Gly Ala Glu Ser Val Leu 
320 325 330 

age ggt caa ttt caa act att att gcg tat cac tgc act cag cct get 2777 
Ser Gly Gin Phe Gin Thr He He Ala Tyr His Cys Thr Gin Pro Ala 
335 340 345 350 

aca aaa age ttc ctg gaa gac ttt ttt atg aaa aac cct tta aag att 2825 
Thr Lys Ser Phe Leu Glu Asp Phe Phe Met Lys Asn Pro Leu Lys He 
355 360 365 

aac aag tta cag egg cac aat tec gtc ggt atg cca tgg ata ggc atg 2873 
Asn Lys Leu Gin Arg His Asn Ser Val Gly Met Pro Trp He Gly Met 
370 375 380 



ggg cag gtt gga eta act cct cct aat cct gta gcc aaa ata aca caa 
Gly Gin Val Gly Leu Thr Pro Pro Asn Pro Val Ala Lys He Thr Gin 
385 390 395 



2921 



cag cag cca 
Gin Gin Pro 
400 

caa cat gaa 
Gin His Glu 
415 

ttt gtc gac 
Phe Val Asp 



tgg gaa ggc 
Trp Glu Gly 



aat cat gta 
Asn His Val 
465 

ata ctg gaa 
lie Leu Glu 
480 

gat cct ggc 
Asp Pro Gly 
495 

aag gtt cca 
Lys Val Pro 



gaa cag ttg 
Glu Gin Leu 



aat gag aat 
Asn Glu Asn 
545 

cca gga gat 
Pro Gly Asp 
560 

ata aat ccg 
lie Asn Pro 

575 

gag etc tta 
Glu Leu Leu 



cca ggg gat 
Pro Gly Asp 



cat aca aag 
His Thr Lys 



aac age aaa 
Asn Ser Lys 
420 

cag tct gat 
Gin Ser Asp 
435 

gga tec tea 
Gly Ser Ser 
450 

gac agt ate 

Asp Ser lie 



gca get ggc 
Ala Ala Gly 



ctg aca act 
Leu Thr Thr 
500 

gac gaa aac 
Asp Glu Asn 
515 

gca aaa ata 
Ala Lys lie 
530 

tea gta gga 
Ser Val Gly 



tta atg atg 
Leu Met Met 



acg atg cga 
Thr Met Arg 
580 

teg gcg aca 
Ser Ala Thr 
595 

gtt ata tea 
Val lie Ser 
610 



ace gta ggc 
Thr Val Gly 
405 

cgt agt act 
Arg Ser Thr 



cct atg ggc 
Pro Met Gly 



aac ace agt 
Asn Thr Ser 
455 

agt aca tec 
Ser Thr Ser 
470 

gtt gat ttg 
Val Asp Leu 
485 

gaa aac aac 
Glu Asn Asn 



ctt aca cca 
Leu Thr Pro 



aaa aaa atg 
Lys Lys Met 
535 

get aat gta 
Ala Asn Val 
550 

ggg atg teg 
Gly Met Ser 
565 

caa ctg cat 
Gin Leu His 



agt tea gga 
Ser Ser Gly 



gat atg ggt 
Asp Met Gly 
615 



eta ttg aaa 
Leu Leu Lys 
410 

gta age gcg 
Val Ser Ala 
425 

aac gaa act 
Asn Glu Thr 
440 

agg tct gga 
Arg Ser Gly 



age gag tea 
Ser Glu Ser 



gga cag gtc 
Gly Gin Val 
490 

att gta tea 
lie Val Ser 
505 

caa cag egg 
Gin Gin Arg 
520 

aat caa ttt 
Asn Gin Phe 



age tea cag 
Ser Ser Gin 



ggt ggc gga 
Gly Gly Gly 
570 

atg cca ggt 
Met Pro Gly 
585 

ctt teg gaa 
Leu Ser Glu 
600 

gee gta ata 
Ala Val lie 



ccc caa ttc 
Pro Gin Phe 



cct age aac 
Pro Ser Asn 



gaa ttg atg 
Glu Leu Met 
445 

caa aac tea 
Gin Asn Ser 
460 

cag gca ata 
Gin Ala He 
475 

aca aaa gga 
Thr Lys Gly 



ctg caa gga 
Leu Gin Gly 



caa cat egg 
Gin His Arg 
525 

ctt ttt cct 
Leu Phe Pro 
540 

ata aca aaa 
He Thr Lys 
555 

ggc gga tct 
Gly Gly Ser 



aac gee aaa 
Asn Ala Lys 



gat gta atg 
Asp Val Met 
605 

gga tgt aat 

Gly Cys Asn 
620 



aat 2969 
Asn 



tct 3017 

Ser 

430 

tgc 3065 
Cys 



cga 3113 
Arg 



aag 3161 
Lys 



age 3209 
Ser 



gtt 3257 

Val 

510 

gaa 3305 
Glu 



gaa 3353 
Glu 



att 3401 
He 



att 3449 
He 



teg 3497 

Ser 

590 

cat 3545 
His 



aat 3593 
Asn 



aat caa aaa 
Asn Gin Lys 
625 

gga aca act 
Gly Thr Thr 
640 

ggc gcc ccg 
Gly Ala Pro 
655 

teg ttt ggc 
Ser Phe Gly 



tct aag gaa 
Ser Lys Glu 



gga gtt get 
Gly Val Ala 
705 

gaa cgc etc 
Glu Arg Leu 
720 

cca caa cag 
Pro Gin Gin 
735 

cag gtg cga 
Gin Val Arg 



aga tct gcg 
Arg Ser Ala 



cca aac aat 

Pro Asn Asn 
785 

gga ttg ccg 
Gly Leu Pro 
800 

gga tct gtt 
Gly Ser Val 
815 

aca gtg etc 
Thr Val Leu 



ccg tea aat 



acc agt gtg 
Thr Ser Val 



gca get gga 
Ala Ala Gly 



aat ggc aat 
Asn Gly Asn 
660 

aac aca age 
Asn Thr Ser 
675 

gtt tta aat 
Val Leu Asn 
690 

caa atg gag 
Gin Met Glu 



aag ggg ggc 
Lys Gly Gly 



caa acc cct 
Gin Thr Pro 
740 

ccc ctg caa 
Pro Leu Gin 
755 

tea gta cca 
Ser Val Pro 
770 

eta tct etc 
Leu Ser Leu 



acc aac tct 
Thr Asn Ser 



ccg caa get 
Pro Gin Ala 
820 

tea gca aac 
Ser Ala Asn 
835 

caa aat cgt 



caa tgt gga 
Gin Cys Gly 
630 

gta aat gtc 
Val Asn Val 
645 

atg atg gga 
Met Met Gly 



tgc aac gtc 
Cys Asn Val 



caa gat age 
Gin Asp Ser 
695 

tgg teg aag 
Trp Ser Lys 
710 

aag ccc aga 
Lys Pro Arg 
725 

tct gga tct 
Ser Gly Ser 



ggt cca cct 
Gly Pro Pro 



ata gcc act 
lie Ala Thr 

775 

ccg tea ccg 
Pro Ser Pro 
790 

cct age atg 
Pro Ser Met 
805 

aat act teg 
Asn Thr Ser 



aag aac tgt 

Lys Asn Cys 



agt aga aat 



tct gga gta 
Ser Gly Val 



aat atg cat 
Asn Met His 
650 

age tct acg 
Ser Ser Thr 
665 

ate gga acg 
lie Gly Thr 
680 

cga acc cat 
Arg Thr His 



att caa cat 
lie Gin His 



caa gtc act 
Gin Val Thr 
730 

ggt gga aac 
Gly Gly Asn 
745 

cct cct tac 
Pro Pro Tyr 
760 

caa teg ccc 
Gin Ser Pro 



egg aca acc 
Arg Thr Thr 



gat gga aca 
Asp Gly Thr 
810 

acg gtt cag 
Thr Val Gin 
825 

ttt cag gca 
Phe Gin Ala 
840 

acc gga teg 



ggt gtt gtc 
Gly Val Val 
635 

tgc tea age 
Cys Ser Ser 



gat atg eta 
Asp Met Leu 



gcc cca gat 
Ala Pro Asp 
685 

tea cat caa 
Ser His Gin 
700 

caa ttt ttc 
Gin Phe Phe 
715 

gga act gta 
Gly Thr Val 



teg tta aac 
Ser Leu Asn 



cac tec ate 
His Ser lie 
765 

aat ccc teg 
Asn Pro Ser 
780 

gca gca gtc 
Ala Ala Val 
795 

gga tea tta 
Gly Ser Leu 



gca ggc aca 
Ala Gly Thr 



gac acc cca 
Asp Thr Pro 
845 

tea age gtt 



act 3641 
Thr 



tec 3689 
Ser 



gcc 3737 

Ala 

670 

atg 3785 
Met 



ggg 3 8 3 3 

Gly 



gaa 3881 
Glu 



gta 3929 
Val 



aac 3977 

Asn 

750 

cag 4025 
Gin 



agt 4073 
Ser 



atg 4121 
Met 



tct 4169 
Ser 



aca 4217 

Thr 

830 

teg 4265 
Ser 



ctt 4313 



Pro Ser Asn Gin Asn Arg Ser Arg Asn Thr Gly Ser Ser Ser Val Leu 
850 855 860 



acg cat aac tta age age aac cca agt acc ccc tta tct cat eta tec 
Thr His Asn Leu Ser Ser Asn Pro Ser Thr Pro Leu Ser His Leu Ser 
865 870 875 



4361 



cca aag gaa ttt gag tct ttc ggt cag tec tct get ggtatgttat 4407 
Pro Lys Glu Phe Glu Ser Phe Gly Gin Ser Ser Ala 
880 885 890 

atttgtttaa tttttttaaa gacaaatcaa atatgaattg cgttaataat aagttatata 4467 

ttacataact eggaaatttg atagaaaaaa tcaggaatag aaaaaataaa ttattttccg 4527 

gaccgcccat ccatttcttg aatccaattt ctggagtgat tgttagagat aatctactat 4587 

taaaattaaa cacgaaaatt catatcegtt aattgaaaat cactattgtt taataagaaa 4647 

ttaaaaatat gtttattata atatttctac a ggt gat aac atg aaa agt agg 4699 

Gly Asp Asn Met Lys Ser Arg 
895 



cga cca age cca cag ggt cag egg tea cca gta aat agt eta ata gag 
Arg Pro Ser Pro Gin Gly Gin Arg Ser Pro Val Asn Ser Leu lie Glu 
900 905 910 



atg ggc tct acc aat ata cag atg gag gtaaatattt aaatatttta 
Met Gly Ser Thr Asn lie Gin Met Glu 
950 



4747 



gca aat aaa gat gta cga ttt get gca tec agt cct ggt ttt aac ccg 4795 

Ala Asn Lys Asp Val Arg Phe Ala Ala Ser Ser Pro Gly Phe Asn Pro 

915 920 925 

cat cca cat atg caa age aat tea aat tea gca tta aac gee tat aaa 4843 

His Pro His Met Gin Ser Asn Ser Asn Ser Ala Leu Asn Ala Tyr Lys 

930 935 940 945 



4890 



tttaacgttt ttgtgttaat ttatcttctt tttcag cgt caa gca tea gcg caa 4944 

Arg Gin Ala Ser Ala Gin 
955 960 

ggt gga tec gta caa ttt agt egg cgc tec gat aat att ccg eta aat 4992 
Gly Gly Ser Val Gin Phe Ser Arg Arg Ser Asp Asn lie Pro Leu Asn 
965 970 975 

ccc aat agt ggc aat egg ccg cca cca aac aag atg acc caa aac ttc 504 0 

Pro Asn Ser Gly Asn Arg Pro Pro Pro Asn Lys Met Thr Gin Asn Phe 
980 985 990 

gat cca ate tct tct ttg gca caa atg tec caa caa eta aca agt tgc 5088 
Asp Pro lie Ser Ser Leu Ala Gin Met Ser Gin Gin Leu Thr Ser Cys 
995 1000 1005 

gtg tec age atg ggt agt cca gee gga act ggt ggt atg acg atg 5133 
Val Ser Ser Met Gly Ser Pro Ala Gly Thr Gly Gly Met Thr Met 



1010 



1015 



1020 



atg ggg ggt ccg gga ccg tec gac ate aat att gag 
Met Gly Gly Pro Gly Pro Ser Asp lie Asn lie Glu 
1025 1030 1035 



cat gga ata 
His Gly lie 



5178 



att teg gga eta gat gga tea gga ata gat acc ata aat caa aat 
He ser Gly Leu Asp Gly Ser Gly lie Asp Thr lie Asn Gin Asn 
1040 1045 1050 



5223 



aac tgt cat tea atg aat gtc gta atg aac tea atg 
Asn Cys His Ser Met Asn Val Val Met Asn Ser Met 
1055 1060 1065 



ggt ccc cga 
Gly Pro Arg 



5268 



atg ctg aat cct aaa atg tgc gta gca ggc ggt cca aat gga ccg 
Met Leu Asn Pro Lys Met Cys Val Ala Gly Gly Pro Asn Gly Pro 
1070 1075 1080 



5313 



cct ggc ttt aat cct aat tec ccc aat ggt gga tta 
Pro Gly Phe Asn Pro Asn Ser Pro Asn Gly Gly Leu 
1085 1090 1095 



aga gag aat 
Arg Glu Asn 



5358 



tec ata ggg tct ggc tgt ggc tea gca aac tct tea 
Ser lie Gly Ser Gly Cys Gly Ser Ala Asn Ser Ser 
1100 1105 1110 



aac ttt caa 
Asn Phe Gin 



5403 



ggg gtt gtt cca cct ggt gee 
Gly Val Val Pro Pro Gly Ala 
1115 1120 



aga atg atg ggt cga 
Arg Met Met Gly Arg 
1125 



atg cca gtc 
Met Pro Val 



5448 



aat ttt ggt teg aat ttc aat 
Asn Phe Gly Ser Asn Phe Asn 
1130 1135 



ccg aat att cag gta 
Pro Asn lie Gin Val 
1140 



aag gcg agt 

Lys Ala Ser 



5493 



acc cca aac acc ata caa tac 
Thr Pro Asn Thr lie Gin Tyr 
1145 1150 



atg cca gta agg gca 
Met Pro Val Arg Ala 
1155 



cag aac gec 
Gin Asn Ala 



5538 



aac aac aat aac aac aat gga get aat aat gtg cga atg cca cct 
Asn Asn Asn Asn Asn Asn Gly Ala Asn Asn Val Arg Met Pro Pro 
1160 1165 1170 



5583 



agt ctg gaa ttt ttg cag agg tac get aac cct caa 
Ser Leu Glu Phe Leu Gin Arg Tyr Ala Asn Pro Gin 
1175 1180 1185 



atg ggt get 
Met Gly Ala 



5628 



gta ggc aat ggg teg cca ata 
Val Gly Asn Gly Ser Pro lie 
1190 1195 



tgc cca cca tea gee age gac ggt 
Cys Pro Pro Ser Ala Ser Asp Gly 
1200 



5673 



act cct gga atg cca gga ttg 
Thr Pro Gly Met Pro Gly Leu 
1205 1210 



atg gcg gga cca gga 
Met Ala Gly Pro Gly 
1215 



gee gga ggt 
Ala Gly Gly 



5718 



atg eta atg aat tct tec gga gag caa cac cag aac aag ate aca 
Met Leu Met Asn Ser Ser Gly Glu Gin His Gin Asn Lys lie Thr 
1220 1225 1230 



5763 



aac aat cct ggg gca age aat ggt att aac ttc ttt 
Asn Asn Pro Gly Ala Ser Asn Gly He Asn Phe Phe 
1235 1240 1245 



cag aat tgc 
Gin Asn Cys 



5808 



aat caa atg tct att gtt gac 
Asn Gin Met Ser He Val Asp 
1250 1255 



gaa gag ggt gga tta ccc ggc cat 
Glu Glu Gly Gly Leu Pro Gly His 
1260 



5853 



gac gga tea atg aat att ggt caa cca tct atg ata 
Asp Gly Ser Met Asn He Gly Gin Pro Ser Met He 
1265 1270 1275 



agg ggc atg 
Arg Gly Met 



5898 



cgt cca cat gec atg egg cca aat gta atg ggt gcg egg atg cca 
Arg Pro His Ala Met Arg Pro Asn Val Met Gly Ala Arg Met Pro 
1280 1285 1290 



5943 



ccc gtt aac agg caa att cag ttt gca cag tea teg gat ggt att 
Pro Val Asn Arg Gin lie Gin Phe Ala Gin Ser Ser Asp Gly He 
1295 1300 1305 



5988 



gac tgt gtc ggg gat ccg tea 
Asp Cys Val Gly Asp Pro Ser 
1310 1315 



tea ttt ttc act aac get tec tgc 
Ser Phe Phe Thr Asn Ala Ser Cys 
1320 



6033 



aac age get gga cca cac atg ttt gga tea gca caa 
Asn Ser Ala Gly Pro His Met Phe Gly Ser Ala Gin 
1325 1330 1335 



cag gee aat 
Gin Ala Asn 



6078 



cag cct aag aca caa cac ata 
Gin Pro Lys Thr Gin His He 
1340 1345 



aag aac ata cct agt 
Lys Asn He Pro Ser 
1350 



gga atg tgt 
Gly Met Cys 



6123 



caa aac caa teg gga ctt gca 
Gin Asn Gin Ser Gly Leu Ala 
1355 1360 



gtg gca caa ggg cag 
Val Ala Gin Gly Gin 
1365 



ate caa ctg 
He Gin Leu 



6168 



cat ggg caa gga cat gcg cag 
His Gly Gin Gly His Ala Gin 
1370 1375 



ggt cag tct tta att gga cct act 
Gly Gin Ser Leu He Gly Pro Thr 
1380 



6213 



aat aat aat tta atg tea act gec gga agt gtc agt 
Asn Asn Asn Leu Met Ser Thr Ala Gly Ser Val Ser 
1385 1390 1395 



get act aac 
Ala Thr Asn 



6258 



ggt gtc tct ggc ate aat ttc gta ggt ccc tct tct 
Gly Val Ser Gly He Asn Phe Val Gly Pro Ser Ser 
1400 1405 1410 



acg gac ctg 
Thr Asp Leu 



6303 



aag tat gee cag caa tat cat agt ttt cag cag cag 
Lys Tyr Ala Gin Gin Tyr His Ser Phe Gin Gin Gin 
1415 1420 1425 



tta tat get 
Leu Tyr Ala 



6348 



ace aac acc aga agt caa caa caa cag cat atg cac cag cag cac 
Thr Asn Thr Arg Ser Gin Gin Gin Gin His Met His Gin Gin His 
1430 1435 1440 



6393 



cag age aac atg ata aca atg ccg ccg aat tta tea cca aat cca 6438 
Gin Ser Asn Met lie Thr Met Pro Pro Asn Leu Ser Pro Asn Pro 
1445 1450 1455 

acg ttc ttt gtc aac aaa taaacttcta aatttttgee gccctcgtca 6486 
Thr Phe Phe Val Asn Lys 
1460 

tgtattgttt actagtctcc aaattaagac atgeatctet aaataagatt ttttgaagct 6546 

tatttactta ggtgttttta caaeggagaa aataaacttt tggatatgea aatgataacg 6606 

ttggaaacaa cataattcat ttgcaacttt tagaagtcac gtcgaagtta aatgtagaat 6666 

ctgtatttta acataatagg tcatctgtaa aaataattaa acatcgaaat tttagttatc 6726 

agcagctatt ttctgttatt atttaatatg tgcgctgctc tctctgtgtt aaatgaaatt 6786 

aaaatatata tataaatgta aaaegctatt gatatatatt gctctcaact gtattgtaat 6846 

caatattaag agaactgtaa attcttccat ataaaggtaa tgaaaaaaaa aaaaaaaaaa 6906 

aaa 6909 



<210> 2 
<211> 28 
<2 12> PRT 

<213> Drosophila lgs 
<400> 2 

lie Phe Val Phe Ser Thr Gin Leu Ala Asn Lys Gly Ala Glu Ser Val 
15 10 15 



Leu Ser Gly Gin Phe Gin Thr lie lie Ala Tyr His 
20 25 



<210> 3 
<211> 28 
<212> PRT 

<213> Human lgs/bcl9 
<400> 3 

Val Tyr Val Phe Ser Thr Glu Met Ala Asn Lys Ala Ala Glu Ala Val 
15 10 15 



Leu Lys Gly Gin Val Glu Thr lie Val Ser Phe His 
20 25 



<210> 4 
<211> 35 



<212> PRT 

<213> Drosophila lgs 



<400> 4 

Glu Asn Leu Thr Pro Gin Gin Arg Gin His Arg Glu Glu Gin Leu Ala 
15 10 15 



Lys lie Lys Lys Met Asn Gin Phe Leu Phe Pro Glu Asn Glu Asn Ser 
20 25 30 



Val Gly Ala 
35 



<210> 5 
<211> 35 
<212> PRT 

<213> Human lgs/bcl9 
<400> 5 

Asp Gly Leu Ser Gin Glu Gin Leu Glu His Arg Glu Arg Ser Leu Gin 
15 10 15 



Thr Leu Arg Asp lie Gin Arg Met Leu Phe Pro Asp Glu Lys Glu Phe 
20 25 30 



Thr Gly Ala 
35 



<210> 6 
<211> 15 
<212> PRT 

<2 13 > Drosophila lgs 
<400> 6 

Gin Met Glu Trp Ser Lys lie Gin His Gin Phe Phe Glu Glu Arg 
15 10 15 



<210> 7 

<211> 15 

<212> PRT 

<213> Human lgs/bcl9 

<400> 7 



Gin lie Ala Trp Leu Lys Leu Gin Gin Glu Phe Tyr Glu Glu Lys 
15 10 15 



<210> 8 

<211> 9 

<212> PRT 

<213> Drosophila lgs 

<400> 8 

Leu Gin Gly Pro Pro Pro Pro Tyr His 
1 5 



<210> 9 
<211> 9 
<212> PRT 

<213> Human lgs/bcl9 
<400> 9 

Val Arg Gly Pro Pro Pro Pro Tyr Gin 



<210> 10 
<211> 112 
<212> PRT 

<213> Drosophila lgs 
<400> 10 

Ser Ala Ser Val Pro lie Ala Thr Gin Ser Pro Asn Pro Ser Ser Pro 
15 10 15 



Asn Asn Leu Ser Leu Pro Ser Pro Arg Thr Thr Ala Ala Val Met Gly 
20 25 30 



Leu Pro Thr Asn Ser Pro Ser Met Asp Gly Thr Gly Ser Leu Ser Gly 
35 40 45 



Ser Val Pro Gin Ala Asn Thr Ser Thr Val Gin Ala Gly Thr Thr Thr 
50 55 60 



Val Leu Ser Ala Asn Lys Asn Cys Phe Gin Ala Asp Thr Pro Ser Pro 
65 70 75 80 



Ser Asn Gin Asn Arg Ser Arg Asn Thr Gly Ser Ser Ser Val Leu Thr 
85 90 95 



His Asn Leu Ser Ser Asn Pro Ser Thr Pro Leu Ser His Leu Ser Pro 
100 105 110 



<210> 11 
<211> HI 
<212> PRT 

<213> Human lgs/bcl9 
<400> 11 

Gly Pro Pro Pro Pro Thr Ala Ser Gin Pro Ala Ser Val Asn lie Pro 
15 10 15 



Gly Ser Leu Pro Ser Ser Thr Pro Tyr Thr Met Pro Pro Glu Pro Thr 
20 25 30 



Leu Ser Gin Asn Pro Leu Ser lie Met Met Ser Arg Met Ser Lys Phe 
35 40 45 



Ala Met Pro Ser Ser Thr Pro Leu Tyr His Asp Ala lie Lys Thr Val 
50 55 60 



Ala Ser Ser Asp Asp Asp Ser Pro Pro Ala Arg Ser Pro Asn Leu Pro 
65 70 75 80 



Ser Met Asn Asn Met Pro Gly Met Gly lie Asn Thr Gin Asn Pro Arg 
85 90 95 



lie Ser Gly Pro Asn Pro Val Val Pro Met Pro Thr Leu Ser Pro 
100 105 110 



<210> 


12 


<211> 


16 


<212> 


PRT 


<213> 


Drosophila lgs 


<400> 


12 


Asn Pro Lys Met Cys Val 


1 


5 


<210> 


13 


<211> 


16 


<212> 


PRT 


<213> 


Human lgs/bcl9 


<400> 


13 



Asp Ala Ala Leu Cys Lys Pro Gly Gly Pro Gly Gly Pro Asp Ser Phe 
15 10 15 



<210> 14 
<211> 4281 
<212> DNA 

<213> Human lgs/bcl9 
<400> 14 

atgcattcca gtaaccctaa agtgaggagc tctccatcag gaaacacaca gagtagccct 60 

aagtcaaagc aggaggtgat ggtccgtccc cctacagtga tgtccccatc tggaaacccc 120 

cagctggatt ccaaattctc caatcagggt aaacaggggg gctcagccag ccaatcccag 180 

ccatccccct gtgactccaa gagtgggggc cataccccta aagcactccc tggcccaggt 240 

gggagcatgg ggctgaagaa tggggctgga aatggtgcca agggcaaggg gaaaagggag 300 

cgaagtattt ccgccgactc ctttgatcag agagatcctg ggactccaaa cgatgactct 360 

gacattaaag aatgtaattc tgctgaccac ataaagtccc aggattccca gcacacacca 420 

cactcgatga ccccatcaaa tgctacagcc cccaggtctt ctaccccctc ccatggccaa 480 

actactgcca cagagcccac acctgctcag aagactccag ccaaagtggt gtacgtgttt 540 

tctactgaga tggccaataa agctgcagaa gctgttttga agggccaggt tgaaactatc 600 

gtctctttcc acatccagaa catttctaac aacaagacag agagaagcac agcgcctctg 660 

aacacacaga tatctgccct tcggaatgat ccgaaacctc tcccacaaca gcccccagct 720 

ccggccaacc aggaccagaa ttcttcccag aataccagac tgcagccaac tccacccatt 780 

ccggcaccag cacccaagcc tgccgcaccc ccacgtcccc tggaccggga gagtcctggg 840 

gtagaaaaca aactgattcc ttctgtagga agtcctgcca gctccactcc actgccccca 900 

gatggtactg ggcccaactc aactcccaac aatagggcag tgacccctgt ctcccagggg 960 

agcaatagct cttcagcaga tcccaaagcc cctccgcctc caccagtgtc cagtggcgag 1020 

ccccccacac tgggagagaa tcccgatggc ctatctcagg agcagctgga gcaccgggag 1080 

cgctccttac aaactctcag agatatccag cgcatgcttt ttcctgatga gaaagaattc 1140 

acaggagcac aaagtggggg accgcagcag aatcctgggg tattagatgg gcctcagaaa 1200 

aaaccagaag ggccaataca ggccatgatg gcccaatccc aaagcctagg taagggacct 1260 

gggccccgga cagacgtggg agctccattt ggccctcaag gacatagaga tgtacccttt 1320 

tctccagatg aaatggttcc accttctatg aactcccagt ctgggaccat aggacccgac 1380 

caccttgacc atatgactcc cgagcagata gcgtggctga aactgcagca ggagttttat 1440 

gaagagaaga ggaggaagca ggaacaagtg gttgtccagc agtgttccct ccaggacatg 1500 



atggtccatc agcacgggcc tcggggagtg gtccgaggac ccccccctcc ataccagatg 1560 

acccctagtg aaggctgggc acctgggggt acagagccat tttctgatgg tatcaacatg 1620 

ccacattctc tgcccccgag gggcatggct ccccacccca acatgccagg gagccagatg 1680 

cgcctccctg gatttgcagg catgataaac tctgaaatgg aagggccgaa tgtccccaac 1740 

cctgcatcta gaccaggtct ttctggagtc agttggccag atgatgtgcc aaaaatccca 1800 

gatggtcgaa attttcctcc tggccagggc attttcagcg gtcctggccg aggggaacgc 1860 

ttcccaaacc cccaaggatt gtctgaagag atgtttcagc agcagctggc agagaaacag 192 0 

ctgggtctcc ccccagggat ggccatggaa ggcatcaggc ccagcatgga gatgaacagg 1980 

atgattccag gctcccagcg ccacatggag cctgggaata accccatttt ccctcgaata 2040 

ccagttgagg gccctctgag tccttctagg ggtgactttc caaaaggaat tcccccacag 2100 

atgggccctg gtcgggaact tgagtttggg atggttccta gtgggatgaa gggagatgtc 2160 

aatctaaatg tcaacatggg atccaactct cagatgatac ctcagaagat gagagaggct 2220 

ggggcgggcc ctgaggagat gctgaaatta cgcccaggtg gctcagacat gctgcctgct 2280 

cagcagaaga tggtgccact gccatttggt gagcaccccc agcaggagta tggcatgggc 2340 

cccagaccat tccttcccat gtctcagggt ccaggcagca acagtggctt gcggaatctc 2400 

agagaaccaa ttgggcccga ccagaggact aacagccggc tcagtcatat gccaccacta 2460 

cctctcaacc cttccagtaa ccccaccagc ctcaacacag ctcctccagt tcagcgcggc 2520 

ctggggcgga agcccttgga tatatctgtg gcaggcagcc aggtgcattc cccaggcatt 2580 

aaccctctga agtctcccac gatgcaccaa gtccagtcac caatgctggg ctcgccctcg 2640 

gggaacctca agtcccccca gactccatcg cagctggcag gcatgctggc gggcccagct 2700 

gctgctgctt ccattaagtc cccccctgtt ttggggtctg ctgctgcttc acctgtccac 2760 

ctcaagtctc catcacttcc tgccccgtca cctggatgga cctcttctcc aaaacctccc 2820 

cttcagagtc ctgggatccc tccaaaccat aaagcacccc tcaccatggc ctccccagcc 2880 

atgctgggaa atgtagagtc aggtggcccc ccacctccta cagccagcca gcctgcctct 2940 

gtgaatatcc ctggaagtct tccctctagt acaccttata ccatgcctcc agagccaacc 3000 

ctttcccaga acccactctc tattatgatg tctcgaatgt ccaagtttgc aatgcccagt 3060 

tccaccccgt tataccatga tgctatcaag actgtggcca gctcagatga cgactcccct 3120 

ccagctcgtt ctcccaactt gccatcaatg aataatatgc caggaatggg cattaataca 3180 

cagaatcctc gaatttcagg tccaaacccc gtggttccga tgccaaccct cagcccaatg 3240 



ggaatgaccc 


agccactttc 


tcactccaat 


cagatgccct 


ctccaaatgc 


cgtgggaccc 


3300 


aacatacctc 


ctcatggggt 


cccaatgggg 


cctggcttga 


tgtcacacaa 


tcctatcatg 


3360 


gggcatgggt 


cccaggagcc 


accgatggta 


cctcaaggac 


ggatgggctt 


cccccagggc 


3420 


ttccctccag 


tacagtctcc 


cccacagcag 


gttccattcc 


ctcacaatgg 


ccccagtggg 


3480 


gggcagggca 


gcttcccagg 


agggatgggt 


ttcccaggag 


aaggccccct 


tggccgcccc 


3540 


agcaacctgc 


cccaaagttc 


agcagatgca 


gcactttgca 


agcctggagg 


ccccgggggt 


3600 


cctgactcct 


tcactgtcct 


ggggaacagc 


atgccttcgg 


tgtttacaga 


cccagatctg 


3660 


caggaggtca 


tccgacctgg 


agccaccgga 


atacctgagt 


ttgatctatc 


ccgcattatt 


3720 


ccatctgaga 


agcccagcca 


gacgctgcaa 


tatttccctc 


gaggggaagt 


tccaggccgt 


3780 


aaacagcccc 


agggtcctgg 


acctgggttt 


tcacacatgc 


aggggatgat 


gggcgaacaa 


3840 


gcccccagaa 


tgggactagc 


attacctggc 


atgggaggtc 


cagggccagt 


gggaactccg 


3900 


gacatccctc 


ttggtacagc 


tccatccatg 


ccaggccaca 


accccatgag 


accaccagcc 


3960 


tttctccaac 


aaggcatgat 


gggacctcac 


catcggatga 


tgtcaccagc 


acaatctaca 


4020 


atgcccggcc 


agcccaccct 


gatgagcaat 


ccagctgctg 


ccgtgggcat 


gattcctggc 


4080 


aaggatcggg 


ggcctgccgg 


gctctacacc 


caccctgggc 


ctgtgggctc 


tccaggcatg 


4140 


atgatgtcca 


tgcagggcat 


gatgggaccc 


caacagaaca 


tcatgatccc 


cccacagatg 


4200 


aggccccggg 


gcatggctgc 


tgacgtgggc 


atgggtggat 


ttagccaagg 


acctggcaac 


4260 


ccaggaaaca 


tgatgtttta 


a 








4281 



<210> 15 
<211> 1426 
<212> PRT 

<213> Human lgs/bcl9 
<400> 15 

Met His Ser Ser Asn Pro Lys Val Arg Ser Ser Pro Ser Gly Asn Thr 
15 10 15 

Gin Ser Ser Pro Lys Ser Lys Gin Glu Val Met Val Arg Pro Pro Thr 
20 25 30 

Val Met Ser Pro Ser Gly Asn Pro Gin Leu Asp Ser Lys Phe Ser Asn 
35 40 45 



Gin Gly Lys Gin Gly Gly Ser Ala Ser Gin Ser Gin Pro Ser Pro Cys 
50 55 60 



Asp Ser Lys Ser Gly Gly His Thr Pro Lys Ala Leu Pro Gly Pro Gly 
65 70 75 80 



Gly Ser Met Gly Leu Lys Asn Gly Ala Gly Asn Gly Ala Lys Gly Lys 
85 90 95 



Gly Lys Arg Glu Arg Ser lie Ser Ala Asp Ser Phe Asp Gin Arg Asp 
100 105 110 



Pro Gly Thr Pro Asn Asp Asp Ser Asp lie Lys Glu Cys Asn Ser Ala 
115 120 125 



Asp His lie Lys Ser Gin Asp Ser Gin His Thr Pro His Ser Met Thr 
130 135 140 



Pro Ser Asn Ala Thr Ala Pro Arg Ser Ser Thr Pro Ser His Gly Gin 
145 150 155 160 



Thr Thr Ala Thr Glu Pro Thr Pro Ala Gin Lys Thr Pro Ala Lys Val 
165 170 175 



Val Tyr Val Phe Ser Thr Glu Met Ala Asn Lys Ala Ala Glu Ala Val 
180 185 190 



Leu Lys Gly Gin Val Glu Thr lie Val Ser Phe His lie Gin Asn lie 
195 200 205 



Ser Asn Asn Lys Thr Glu Arg Ser Thr Ala Pro Leu Asn Thr Gin lie 
210 215 220 



Ser Ala Leu Arg Asn Asp Pro Lys Pro Leu Pro Gin Gin Pro Pro Ala 
225 230 235 240 



Pro Ala Asn Gin Asp Gin Asn Ser Ser Gin Asn Thr Arg Leu Gin Pro 
245 250 255 



Thr Pro Pro lie Pro Ala Pro Ala Pro Lys Pro Ala Ala Pro Pro Arg 
260 265 270 



Pro Leu Asp Arg Glu Ser Pro Gly Val Glu Asn Lys Leu lie Pro Ser 



275 



280 



285 



Val Gly Ser Pro Ala Ser Ser Thr Pro Leu Pro Pro Asp Gly Thr Gly 
290 295 300 



Pro Asn Ser Thr Pro Asn Asn Arg Ala Val Thr Pro Val Ser Gin Gly 
305 310 315 320 



Ser Asn Ser Ser Ser Ala Asp Pro Lys Ala Pro Pro Pro Pro Pro Val 
325 330 335 



Ser Ser Gly Glu Pro Pro Thr Leu Gly Glu Asn Pro Asp Gly Leu Ser 
340 345 350 



Gin Glu Gin Leu Glu His Arg Glu Arg Ser Leu Gin Thr Leu Arg Asp 
355 360 365 



lie Gin Arg Met Leu Phe Pro Asp Glu Lys Glu Phe Thr Gly Ala Gin 
370 375 380 



Ser Gly Gly Pro Gin Gin Asn Pro Gly Val Leu Asp Gly Pro Gin Lys 
385 390 395 400 



Lys Pro Glu Gly Pro lie Gin Ala Met Met Ala Gin Ser Gin Ser Leu 
405 410 415 



Gly Lys Gly Pro Gly Pro Arg Thr Asp Val Gly Ala Pro Phe Gly Pro 
420 425 430 



Gin Gly His Arg Asp Val Pro Phe Ser Pro Asp Glu Met Val Pro Pro 
435 440 445 



Ser Met Asn Ser Gin Ser Gly Thr lie Gly Pro Asp His Leu Asp His 
450 455 460 



Met Thr Pro Glu Gin lie Ala Trp Leu Lys Leu Gin Gin Glu Phe Tyr 
465 470 475 480 



Glu Glu Lys Arg Arg Lys Gin Glu Gin Val Val Val Gin Gin Cys Ser 
485 490 495 



Leu Gin Asp Met Met Val His Gin His Gly Pro Arg Gly Val Val Arg 
500 505 510 



Gly Pro Pro Pro Pro Tyr Gin Met Thr Pro Ser Glu Gly Trp Ala Pro 
515 520 525 



Gly Gly Thr Glu Pro Phe Ser Asp Gly lie Asn Met Pro His Ser Leu 
530 535 540 



Pro Pro Arg Gly Met Ala Pro His Pro Asn Met Pro Gly Ser Gin Met 
545 550 555 560 



Arg Leu Pro Gly Phe Ala Gly Met lie Asn Ser Glu Met Glu Gly Pro 
565 570 575 



Asn Val Pro Asn Pro Ala Ser Arg Pro Gly Leu Ser Gly Val Ser Trp 
580 585 590 



Pro Asp Asp Val Pro Lys lie Pro Asp Gly Arg Asn Phe Pro Pro Gly 
595 600 605 



Gin Gly lie Phe Ser Gly Pro Gly Arg Gly Glu Arg Phe Pro Asn Pro 
610 615 620 



Gin Gly Leu Ser Glu Glu Met Phe Gin Gin Gin Leu Ala Glu Lys Gin 
625 630 635 640 



Leu Gly Leu Pro Pro Gly Met Ala Met Glu Gly lie Arg Pro Ser Met 
645 650 655 



Glu Met Asn Arg Met He Pro Gly Ser Gin Arg His Met Glu Pro Gly 
660 665 670 



Asn Asn Pro He Phe Pro Arg He Pro Val Glu Gly Pro Leu Ser Pro 
675 680 685 



Ser Arg Gly Asp Phe Pro Lys Gly He Pro Pro Gin Met Gly Pro Gly 
690 695 700 



Arg Glu Leu Glu Phe Gly Met Val Pro Ser Gly Met Lys Gly Asp Val 
705 710 715 720 



Asn Leu Asn Val Asn Met Gly Ser Asn Ser Gin Met He Pro Gin Lys 
725 730 735 



Met Arg Glu Ala Gly Ala Gly Pro Glu Glu Met Leu Lys Leu Arg Pro 
740 745 750 



Gly Gly Ser Asp Met Leu Pro Ala Gin Gin Lys Met Val Pro Leu Pro 
755 760 765 



Phe Gly Glu His Pro Gin Gin Glu Tyr Gly Met Gly Pro Arg Pro Phe 
770 775 780 



Leu Pro Met Ser Gin Gly Pro Gly Ser Asn Ser Gly Leu Arg Asn Leu 
785 790 795 800 



Arg Glu Pro lie Gly Pro Asp Gin Arg Thr Asn Ser Arg Leu Ser His 
805 810 815 



Met Pro Pro Leu Pro Leu Asn Pro Ser Ser Asn Pro Thr Ser Leu Asn 
820 825 830 



Thr Ala Pro Pro Val Gin Arg Gly Leu Gly Arg Lys Pro Leu Asp lie 
835 840 845 



Ser Val Ala Gly Ser Gin Val His Ser Pro Gly lie Asn Pro Leu Lys 
850 855 860 



Ser Pro Thr Met His Gin Val Gin Ser Pro Met Leu Gly Ser Pro Ser 
865 870 875 880 



Gly Asn Leu Lys Ser Pro Gin Thr Pro Ser Gin Leu Ala Gly Met Leu 
885 890 895 



Ala Gly Pro Ala Ala Ala Ala Ser lie Lys Ser Pro Pro Val Leu Gly 
900 905 910 



Ser Ala Ala Ala Ser Pro Val His Leu Lys Ser Pro Ser Leu Pro Ala 
915 920 925 



Pro Ser Pro Gly Trp Thr Ser Ser Pro Lys Pro Pro Leu Gin Ser Pro 
930 935 940 



Gly lie Pro Pro Asn His Lys Ala Pro Leu Thr Met Ala Ser Pro Ala 
945 950 955 960 



Met Leu Gly Asn Val Glu Ser Gly Gly Pro Pro Pro Pro Thr Ala Ser 
965 970 975 



Gin Pro Ala Ser Val Asn lie Pro Gly Ser Leu Pro Ser Ser Thr Pro 
980 985 990 



Tyr Thr Met Pro Pro Glu Pro Thr Leu Ser Gin Asn Pro Leu Ser ll« 
995 1000 1005 



Met Met Ser Arg Met Ser Lys Phe Ala Met Pro Ser Ser Thr Pro 
1010 1015 1020 



Leu Tyr His Asp Ala lie Lys Thr Val Ala Ser Ser Asp Asp Asp 
1025 1030 1035 



Ser Pro Pro Ala Arg Ser Pro Asn Leu Pro Ser Met Asn Asn Met 
1040 1045 1050 



Pro Gly Met Gly lie Asn Thr Gin Asn Pro Arg lie Ser Gly Pro 
1055 1060 1065 



Asn Pro Val Val Pro Met Pro Thr Leu Ser Pro Met Gly Met Thr 
1070 1075 1080 



Gin Pro Leu Ser His Ser Asn Gin Met Pro Ser Pro Asn Ala Val 
1085 1090 1095 



Gly Pro Asn lie Pro Pro His Gly Val Pro Met Gly Pro Gly Leu 
1100 1105 1110 



Met Ser His Asn Pro lie Met Gly His Gly Ser Gin Glu Pro Pro 
1115 1120 1125 



Met Val Pro Gin Gly Arg Met Gly Phe Pro Gin Gly Phe Pro Pro 
1130 1135 1140 



Val Gin Ser Pro Pro Gin Gin Val Pro Phe Pro His Asn Gly Pro 
1145 1150 1155 



Ser Gly Gly Gin Gly Ser Phe Pro Gly Gly Met Gly Phe Pro Gly 
1160 1165 1170 



Glu Gly Pro Leu Gly Arg Pro Ser Asn Leu Pro Gin Ser Ser Ala 



1175 



1180 



1185 



Asp Ala Ala Leu Cys Lys Pro Gly Gly Pro Gly Gly Pro Asp Ser 
1190 1195 1200 



Phe Thr Val Leu Gly Asn Ser Met Pro Ser Val Phe Thr Asp Pro 
1205 1210 1215 



Asp Leu Gin Glu Val lie Arg Pro Gly Ala Thr Gly lie Pro Glu 
1220 1225 1230 



Phe Asp Leu Ser Arg lie lie Pro Ser Glu Lys Pro Ser Gin Thr 
1235 1240 1245 



Leu Gin Tyr Phe Pro Arg Gly Glu Val Pro Gly Arg Lys Gin Pro 
1250 1255 1260 



Gin Gly Pro Gly Pro Gly Phe Ser His Met Gin Gly Met Met Gly 
1265 1270 1275 



Glu Gin Ala Pro Arg Met Gly Leu Ala Leu Pro Gly Met Gly Gly 
1280 1285 1290 



Pro Gly Pro Val Gly Thr Pro Asp lie Pro Leu Gly Thr Ala Pro 
1295 1300 1305 



Ser Met Pro Gly His Asn Pro Met Arg Pro Pro Ala Phe Leu Gin 
1310 1315 1320 



Gin Gly Met Met Gly Pro His His Arg Met Met Ser Pro Ala Gin 
1325 1330 1335 



Ser Thr Met Pro Gly Gin Pro Thr Leu Met Ser Asn Pro Ala Ala 
1340 1345 1350 



Ala Val Gly Met lie Pro Gly Lys Asp Arg Gly Pro Ala Gly Leu 
1355 1360 1365 



Tyr Thr His Pro Gly Pro Val Gly Ser Pro Gly Met Met Met Ser 
1370 1375 1380 



Met Gin Gly Met Met Gly Pro Gin Gin Asn lie Met lie Pro Pro 
1385 1390 1395 



Gin Met Arg Pro Arg Gly Met Ala Ala Asp Val Gly Met Gly Gly 
1400 1405 1410 



Phe Ser Gin Gly Pro Gly Asn Pro Gly Asn Met Met Phe 



<210> 16 

<211> 3948 

<212> DNA 

<213> Human lgs-1 

<400> 16 

atggcctgct tcccatcccc tgctgccatc tcctgcaccc ttagggcaca gtgggcatct 60 

cgggagctgc tcagcggaca gactagggtt acccccaccc caggaggaga gaagctccag 120 

ggagcccgcc gctgtccccc gcggtcattg ccccctgccc cagccaagcc aatgcaccca 180 

gaaaataaat tgaccaatca tggcaagaca gggaatggcg gggcccaatc tcagcaccag 240 

aatgtgaacc aaggacccac ctgcaacgtg ggctcgaagg gcgtgggggc ggggaaccat 300 

ggggccaagg ccaaccagat ctcgcctagc aactcaagtc tgaagaaccc ccaggcaggg 360 

gtgccccctt tcagctcgct caagggcaag gtgaagaggg accggagtgt gtctgtggac 420 

tctggagagc agcgagaggc tgggacccca tccctggatt cagaggccaa agaggtggcg 480 

ccgcggagta agcggcgctg tgtgctggag cggaagcagc cgtacagtgg ggacgaatgg 540 

tgctctggac cggacagtga ggaggacgac aagcccattg gggccaccca caaagctgct 600 

ttcaaagaag acggctttca ggacaaggca tcacacttct tctccagcac gtacagtcct 660 

gaaacctcca ggaggaagct gccccaagcc ccgaaggctt ccttcctggg gcagcagggc 720 

cgagtcattt ggaaacctct ctcggaggag ctccgtgatc aaggtgcaga tgcggcaggt 780 

gggccggcct caatcatgtc tccaatcgcg acggtgaatg cgagtggctt gtccaaagag 840 

cagctggagc atcgggaacg gtccctccag acgctgcgag acattgagcg actgctgctc 900 

cgcagcggag agactgagcc cttcctcaag ggggccccca ggaggagcgg cgggctgaag 960 

aaatatgagg aacccttgca gtccatgatt tcacagacac agagcctagg gggccccccg 1020 

ctggagcatg aagtgcctgg gcaccccccg ggtggggaca tggggcagca gatgaacatg 1080 

atgatacaga ggctgggcca ggacagcctc acgcctgagc aggtggcctg gcgcaagctg 1140 

caggaggagt actacgaaga gaaacggcgg aaagaggaac agattgggct gcatgggagc 1200 

cgtcctctgc aggacatgat gggcatgggg ggcatgatgg tgagggggcc cccgcctcct 1260 



taccacagca 


agcctgggga 


tcagtggcca 


cctggaatgg 


gtgcgcagct 


gcgggggccc 


1320 


atggatgttc 


aagatcccat 


gcagctccgg 


ggcggacctc 


cctttcctgg 


gccccgtttc 


1380 


ccaggcaacc 


agatacaacg 


ggtacctggg 


tttgggggca 


tgcagagtat 


gcccatggag 


1440 


gtgcccatga 


atgccatgca 


gaggcccgtg 


agaccaggca 


tgggctggac 


cgaagacttg 


1500 


ccccctatgg 


ggggacccag 


caattttgcc 


cagaacacca 


tgccctaccc 


aggtgggcag 


1560 


ggtgaggcgg 


agcgattcat 


gactccccgg 


gtccgtgagg 


agctgctgcg 


gcaccagctg 


1620 


ctggagaagc 


ggtcgatggg 


catgcagcgc 


cccctgggca 


tggcaggcag 


tggcatggga 


1680 


cagagcatgg 


agatggagcg 


gatgatgcag 


gcgcaccgac 


agatggatcc 


tgccatgttt 


1740 


cccgggcaga 


tggctggtgg 


tgagggcctg 


gcgggcactc 


ccatgggcat 


ggagtttggt 


1800 


ggaggccggg 


gcctcctgag 


ccctcccatg 


gggcagtctg 


ggctgaggga 


ggtggaccca 


1860 


cccatggggc 


caggcaacct 


caacatgaac 


atgaatgtca 


acatgaacat 


gaacatgaac 


1920 


ctgaacgtgc 


agatgacccc 


gcagcagcag 


atgctgatgt 


cgcagaagat 


gcggggccct 


1980 


ggggacttga 


tggggcccca 


gggcctcagt 


cctgaggaga 


tggcccgggt 


tcgggcccag 


2040 


aacagcagtg 


gcatggtgcc 


cttgccttct 


gccaacccgc 


caggacctct 


caagtcgccc 


2100 


caggtcctcg 


gctcctccct 


cagtgtccgt 


tcacccactg 


gctcgcccag 


caggctcaag 


2160 


tctccttcca 


tggcggtgcc 


ttctccaggc 


tgggttgcct 


cacccaagac 


ggccatgccc 


2220 


agcccggggg 


tctcccagaa 


caagcagccg 


cctctcaaca 


tgaactcttc 


caccaccctg 


2280 


agcaacatgg 


aacaggaccc 


cacaccttcc 


cagaaccccc 


tgtcactgat 


gatgacccag 


2340 


atgtccaagt 


acgccatgcc 


cagctccacc 


ccgctctacc 


acaatgccat 


caagaccatc 


2400 


gccacctcag 


acgacgagct 


gctgcccgac 


cggcccctgc 


tgcccccccc 


accaccaccg 


2460 


cagggctccg 


ggccaggtgg 


ccccgactcc 


ctgaatgccc 


cctgtggccc 


agtgcccagc 


2520 


tcctcccaga 


tgatgccctt 


cccccctcgg 


ctgcagcagc 


cccatggtgc 


catggccccc 


2580 


actgggggtg 


ggggcggggg 


gcctggcctg 


cagcagcact 


acccgtcagg 


catggccctg 


2640 


cctcccgagg 


acctgcccaa 


ccagccgcca 


ggccccatgc 


ctccccagca 


gcacctgatg 


2700 


ggcaaagcca 


tggctgggcg 


catgggcgac 


gcatacccac 


cgggtgtgct 


ccctggggtg 


2760 


gcatcagtgc 


tgaacgaccc 


cgagctgagc 


gaggtgatcc 


ggcccacccc 


aacggggatc 


2820 


cccgagttcg 


acttgtcgag 


gatcatcccc 


tctgagaagc 


caagcagcac 


cctccagtac 


2880 


ttccccaaga 


gcgagaacca 


gccccccaag 


gctcagcccc 


ctaatctgca 


tctcatgaac 


2940 



ctgcagaaca 


tgatggcgga 


gcagactccc 


tctcggcctc 


ccaacctccc 


aggccagcag 


3000 


ggcgatcggc 


cgctggtggt 


ggtgataccg 


ggtacccggg 


ctatggcgcc 


ggcgcagcgc 


3060 


tgccctctgt 


gccgccagac 


cttcttctgt 


ggtcgcgggc 


acgtttacag 


ccgcaagcac 


3120 


cagcggcagc 


tgaaggaggc 


tttggagagg 


ctcctgcccc 


aggtggaggc 


ggcccgcaag 


3180 


gccatccgcg 


ccgctcaggt 


ggagcgctat 


gtgcccgaac 


acgagcgatg 


ctgctggtgc 


3240 


ctgtgctgcg 


gctgtgaggt 


gcgggaacac 


ctgagccatg 


gaaacctgac 


ggtgctgtac 


3300 


ggggggctgc 


tggagcatct 


ggccagccca 


gagcacaaga 


aagcaaccaa 


caaattctgg 


3360 


tgggagaaca 


aagctgaggt 


ccagatgaaa 


gagaagtttc 


tggtcactcc 


ccaggattat 


3420 


gcgcgattca 


agaaatccat 


ggtgaaaggt 


ttggattcct 


atgaagaaaa 


ggaggataaa 


3480 


gtgatcaagg 


agatggcagc 


tcagatccgt 


gaggtggagc 


agagccgaca 


ggaggtggtt 


3540 


cggtctgtct 


tagagacagg 


tcccccaaga 


tacgccctca 


cagtccggtc 


ccccgccgtc 


3600 


ctctcccggc 


gcacgctcaa 


gtccggtgcc 


ttccccccgc 


agacccccga 


ggcgcaccct 


3660 


caagctcggt 


gcctctgcgc 


cccccgcagg 


ggcgccctca 


agcctgagcc 


ccccgggcgc 


3720 


accctcaagc 


tcggtgtacc 


cccccatacc 


acccgcaagg 


cgcgccctca 


tgccgcgaag 


~i i o r\ 
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acttcgcccc 


gcccaaggtg 


cacccgtcaa 


gccccgaata 


aaacccagtc 


actccaactt 


3840 


gcaggcaaag 


ctagaaaaac 


tgcgctgcat 


ttgcaaacaa 


aagctcttgt 


tggcgatgac 


3900 


gatactgttt 


tgggtgtgaa 


actgtcaatt 


gctaactacg 


atctgtga 




3948 



<210> 17 

<211> 1115 

<212> PRT 

<213> Human lgs-1 

<400> 17 

Phe Lys Glu Asp Gly Phe Gin Asp Lys Ala Ser His Phe Phe Ser Ser 
15 10 15 

Thr Tyr Ser Pro Glu Thr Ser Arg Arg Lys Leu Pro Gin Ala Pro Lys 
20 25 30 

Ala Ser Phe Leu Gly Gin Gin Gly Arg Val lie Trp Lys Pro Leu Ser 
35 40 45 

Glu Glu Leu Arg Asp Gin Gly Ala Asp Ala Ala Gly Gly Pro Ala Ser 
50 55 60 



lie Met Ser Pro lie Ala Thr Val Asn Ala Ser Gly Leu Ser Lys Glu 
65 70 75 80 



Gin Leu Glu His Arg Glu Arg Ser Leu Gin Thr Leu Arg Asp lie Glu 
85 90 95 



Arg Leu Leu Leu Arg Ser Gly Glu Thr Glu Pro Phe Leu Lys Gly Ala 
100 105 110 



Pro Arg Arg Ser Gly Gly Leu Lys Lys Tyr Glu Glu Pro Leu Gin Ser 
115 120 125 



Met lie Ser Gin Thr Gin Ser Leu Gly Gly Pro Pro Leu Glu His Glu 
130 135 140 



Val Pro Gly His Pro Pro Gly Gly Asp Met Gly Gin Gin Met Asn Met 
145 150 155 160 



Met lie Gin Arg Leu Gly Gin Asp Ser Leu Thr Pro Glu Gin Val Ala 
165 170 175 



Trp Arg Lys Leu Gin Glu Glu Tyr Tyr Glu Glu Lys Arg Arg Lys Glu 
180 185 190 



Glu Gin lie Gly Leu His Gly Ser Arg Pro Leu Gin Asp Met Met Gly 
195 200 205 



Met Gly Gly Met Met Val Arg Gly Pro Pro Pro Pro Tyr His Ser Lys 
210 215 220 



Pro Gly Asp Gin Trp Pro Pro Gly Met Gly Ala Gin Leu Arg Gly Pro 
225 230 235 240 



Met Asp Val Gin Asp Pro Met Gin Leu Arg Gly Gly Pro Pro Phe Pro 
245 250 255 



Gly Pro Arg Phe Pro Gly Asn Gin lie Gin Arg Val Pro Gly Phe Gly 
260 265 270 



Gly Met Gin Ser Met Pro Met Glu Val Pro Met Asn Ala Met Gin Arg 
275 280 285 



Pro Val Arg Pro Gly Met Gly Trp Thr Glu Asp Leu Pro Pro Met Gly 
290 295 300 



Gly Pro Ser Asn Phe Ala Gin Asn Thr Met Pro Tyr Pro Gly Gly Gin 
305 310 315 320 



Gly Glu Ala Glu Arg Phe Met Thr Pro Arg Val Arg Glu Glu Leu Leu 
325 330 335 



Arg His Gin Leu Leu Glu Lys Arg Ser Met Gly Met Gin Arg Pro Leu 
340 345 350 



Gly Met Ala Gly Ser Gly Met Gly Gin Ser Met Glu Met Glu Arg Met 
355 360 365 



Met Gin Ala His Arg Gin Met Asp Pro Ala Met Phe Pro Gly Gin Met 
370 375 380 



Ala Gly Gly Glu Gly Leu Ala Gly Thr Pro Met Gly Met Glu Phe Gly 
385 390 395 400 



Gly Gly Arg Gly Leu Leu Ser Pro Pro Met Gly Gin Ser Gly Leu Arg 
405 410 415 



Glu Val Asp Pro Pro Met Gly Pro Gly Asn Leu Asn Met Asn Met Asn 
420 425 430 



Val Asn Met Asn Met Asn Met Asn Leu Asn Val Gin Met Thr Pro Gin 
435 440 445 



Gin Gin Met Leu Met Ser Gin Lys Met Arg Gly Pro Gly Asp Leu Met 
450 455 460 



Gly Pro Gin Gly Leu Ser Pro Glu Glu Met Ala Arg Val Arg Ala Gin 
465 470 475 480 



Asn Ser Ser Gly Met Val Pro Leu Pro Ser Ala Asn Pro Pro Gly Pro 
485 490 495 



Leu Lys Ser Pro Gin Val Leu Gly Ser Ser Leu Ser Val Arg Ser Pro 
500 505 510 



Thr Gly Ser Pro Ser Arg Leu Lys Ser Pro Ser Met Ala Val Pro Ser 



515 



520 



525 



Pro Gly Trp Val Ala Ser Pro Lys Thr Ala Met Pro Ser Pro Gly Val 
530 535 540 



Ser Gin Asn Lys Gin Pro Pro Leu Asn Met Asn Ser Ser Thr Thr Leu 
545 550 555 560 



Ser Asn Met Glu Gin Asp Pro Thr Pro Ser Gin Asn Pro Leu Ser Leu 
565 570 575 



Met Met Thr Gin Met Ser Lys Tyr Ala Met Pro Ser Ser Thr Pro Leu 
580 585 590 



Tyr His Asn Ala lie Lys Thr lie Ala Thr Ser Asp Asp Glu Leu Leu 
595 600 605 



Pro Asp Arg Pro Leu Leu Pro Pro Pro Pro Pro Pro Gin Gly Ser Gly 
610 615 620 



Pro Gly Gly Pro Asp Ser Leu Asn Ala Pro Cys Gly Pro Val Pro Ser 
625 630 635 640 



Ser Ser Gin Met Met Pro Phe Pro Pro Arg Leu Gin Gin Pro His Gly 
645 650 655 



Ala Met Ala Pro Thr Gly Gly Gly Gly Gly Gly Pro Gly Leu Gin Gin 
660 665 670 



His Tyr Pro Ser Gly Met Ala Leu Pro Pro Glu Asp Leu Pro Asn Gin 
675 680 685 



Pro Pro Gly Pro Met Pro Pro Gin Gin His Leu Met Gly Lys Ala Met 
690 695 700 



Ala Gly Arg Met Gly Asp Ala Tyr Pro Pro Gly Val Leu Pro Gly Val 
705 710 715 720 



Ala Ser Val Leu Asn Asp Pro Glu Leu Ser Glu Val lie Arg Pro Thr 
725 730 735 



Pro Thr Gly lie Pro Glu Phe Asp Leu Ser Arg lie lie Pro Ser Glu 
740 745 750 



Lys Pro Ser Ser Thr Leu Gin Tyr Phe Pro Lys Ser Glu Asn Gin Pro 
755 760 765 



Pro Lys Ala Gin Pro Pro Asn Leu His Leu Met Asn Leu Gin Asn Met 
770 775 780 



Met Ala Glu Gin Thr Pro Ser Arg Pro Pro Asn Leu Pro Gly Gin Gin 
785 790 795 800 



Gly Asp Arg Pro Leu Val Val Val lie Pro Gly Thr Arg Ala Met Ala 
805 810 815 



Pro Ala Gin Arg Cys Pro Leu Cys Arg Gin Thr Phe Phe Cys Gly Arg 
820 825 830 



Gly His Val Tyr Ser Arg Lys His Gin Arg Gin Leu Lys Glu Ala Leu 
835 840 845 



Glu Arg Leu Leu Pro Gin Val Glu Ala Ala Arg Lys Ala lie Arg Ala 
850 855 860 



Ala Gin Val Glu Arg Tyr Val Pro Glu His Glu Arg Cys Cys Trp Cys 
865 870 875 880 



Leu Cys Cys Gly Cys Glu Val Arg Glu His Leu Ser His Gly Asn Leu 
885 890 895 



Thr Val Leu Tyr Gly Gly Leu Leu Glu His Leu Ala Ser Pro Glu His 
900 905 910 



Lys Lys Ala Thr Asn Lys Phe Trp Trp Glu Asn Lys Ala Glu Val Gin 
915 920 925 



Met Lys Glu Lys Phe Leu Val Thr Pro Gin Asp Tyr Ala Arg Phe Lys 
930 935 940 



Lys Ser Met Val Lys Gly Leu Asp Ser Tyr Glu Glu Lys Glu Asp Lys 
945 950 955 960 



Val lie Lys Glu Met Ala Ala Gin lie Arg Glu Val Glu Gin Ser Arg 
965 970 975 



Gin Glu Val Val Arg Ser Val Leu Glu Thr Gly Pro Pro Arg Tyr Ala 
980 985 990 



Leu Thr Val Arg Ser Pro Ala Val Leu Ser Arg Arg Thr Leu Lys Ser 
995 1000 1005 



Gly Ala Phe Pro Pro Gin Thr Pro Glu Ala His Pro Gin Ala Arg 
1010 1015 1020 



Cys Leu Cys Ala Pro Arg Arg Gly Ala Leu Lys Pro Glu Pro Pro 
1025 1030 1035 



Gly Arg Thr Leu Lys Leu Gly Val Pro Pro His Thr Thr Arg Lys 
1040 1045 1050 



Ala Arg Pro His Ala Ala Lys Thr Ser Pro Arg Pro Arg Cys Thr 
1055 1060 1065 



Arg Gin Ala Pro Asn Lys Thr Gin Ser Leu Gin Leu Ala Gly Lys 
1070 1075 1080 



Ala Arg Lys Thr Ala Leu His Leu Gin Thr Lys Ala Leu Val Gly 
1085 1090 1095 



Asp Asp Asp Thr Val Leu Gly Val Lys Leu Ser lie Ala Asn Tyr 
1100 1105 1110 



Asp Leu 
1115 
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taatacgact cactataggg agaccacttc catgctcatt tcgtcatta 49 
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taatacgact cactataggg agaccactag gatctctcga caacaatg 48 
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<212> DNA 
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<221> misc_structure 

<222> (1) . - (49) 

<223> F Primer 
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taatacgact cactataggg agaccacaca agaccaagtg gacgatatg 49 
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<212> DNA 
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<220> 

<221> misc_structure 

<222> (1) . - (48) 

<223> R Primer 
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taatacgact cactataggg agaccacaat ttgcaagcaa tctgtgac 48 
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<212> DNA 
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taatacgact cactataggg agaccac 



27 



